Classification of Hadiths using LVQ based on VSM Considering Words Orde
نویسندگان
چکیده
The religion of Islam is based on a sacred text called Qur‟an, a divine speech expressed in Arabic language. Qur‟an constitutes the main root of Islam jurisprudence which has a second source of inspiration known as Hadiths. As the Muslim‟s life is governed by those holy texts, need of their authenticity is required. Using VSM (Vector Space Model), we can represent Hadiths as a vector of words. The Term Weighting obtained by multiplying term frequency by the inverse document frequency does not take into account the word order, however, order of narrators is critical to classify Hadith. In this paper we propose a new method considering the words order (in our case the narrator‟s order), to classify Hadiths into four categories: Sahih, Hasan, Da‟if and Maudu‟. We use in this purpose LVQ (Learning Vector Quantization). We got good results for classifying Sahih and Maudu‟ categories. General Terms Hadith categorization, Algorithms.
منابع مشابه
Classification of Hadiths using LVQ based on VSM Considering Words Order
The religion of Islam is based on a sacred text called Qur’an, a divine speech expressed in Arabic language. Qur’an constitutes the main root of Islam jurisprudence which has a second source of inspiration known as Hadiths. As the Muslim’s life is governed by those holy texts, need of their authenticity is required. Using VSM (Vector Space Model), we can represent Hadiths as a vector of words. ...
متن کاملImam Sadegh’s (AS) Hadiths in Sunni’s lexicon
The Quran and Hadiths including Infallibles (AS) Hadiths such as Imam Sadegh (AS) were one of compilation references, and also, one of the fields of research for Arabs morphologists from long time ago. Imam Sadegh’s (AS) Hadiths based on Sunni’s lexicon, and then, based on another Islamic science books will be illustrated in this research in order to identify where these Hadiths hav...
متن کاملText categorization using topic model and ontology networks
Text categorization based on pre-defined document categories is one of the most crucial tasks in text mining applications in recent decades. Successful text categorization highly relies on the text representations generated from documents. In this paper, an innovative text categorization model, VSM_WN_TM, is presented. VSM_WN_TM is a special Vector Space Model (VSM) that incorporates word frequ...
متن کاملPrototype-based minimum classification error/generalized probabilistic descent training for various speech units
In previous work we reported high classiication rates for Learning Vector Quantization (LVQ) networks trained to classify phoneme tokens shifted in time. It has since been shown that the framework of Minimum Classiication Error (MCE) and Generalized Probabilistic Descent (GPD) can treat LVQ as a special case of a general method for gradient descent on a rigorously deened classiication loss meas...
متن کاملLyric-based Song Sentiment Classification with Sentiment Vector Space Model
Lyric-based song sentiment classification seeks to assign songs appropriate sentiment labels such as light-hearted and heavy-hearted. Four problems render vector space model (VSM)-based text classification approach ineffective: 1) Many words within song lyrics actually contribute little to sentiment; 2) Nouns and verbs used to express sentiment are ambiguous; 3) Negations and modifiers around t...
متن کامل